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Abstract 

It is shown that, on a manifold with a Finsler metric of Lorentzian signature, the 
lightlike geodesies satisfy the following variational principle. Among all lightlike curves 
from a point q (emission event) to a timelike curve 7 (worldline of receiver), the lightlike 
geodesies make the arrival time stationary. Here "arrival time" refers to a parametriza- 
tion of the timelike curve 7. This variational principle can be applied (i) to the vacuum 
light rays in an alternative spacetime theory, based on Finsler geometry, and (ii) to 
light rays in an anisotropic non-dispersive medium with a general-relativistic spacetime 
as background. 

1 Introduction 

The versions of Fermat's principle that can be found in standard text-books refer to stationary 
situations, both in general relativity (see e.g. Landau and Lifshitz jTj) and in ordinary optics 
(see e.g. Kline and Kay Pj). The goal is to determine the path of a light ray from one point 
in space to another point in space, under the influence of a time-independent gravitational 
field or a time-independent optical medium. A basic idea of how to generalize these standard 
versions of Fermat's principle to non- stationary situations is due to Kovner [3]. He considered 
an arbitrary spacetime in the sense of general relativity, i.e., a manifold with a pseudo- 
Riemannian metric of Lorentzian signature that need not be stationary. He fixed a point 
(emission event) and a timelike curve (worldline of receiver) in this spacetime. The variational 
principle is to find, among all lightlike curves from the point to the timelike curve, those which 
make the arrival time stationary. Here "arrival time" refers to an arbitrary parametrization 
of the timelike curve. It was proven in |lj that the solution curves of this variational principle 
are, indeed, precisely the lightlike geodesies. Kovner's variational principle can be viewed as 
a general-relativistic Fermat principle for light rays that are influenced by an arbitrarily time- 
dependent gravitational field with no optical medium. For some applications one has to use 
the time-reversed version of this variational principle, which is mathematically completely 
equivalent. Under time reversion an emission event turns into a reception event, and the 
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worldline of a receiver turns into the worldline of an emitter, so each solution to the variational 
principle corresponds to an image of the emitter that is seen at the chosen reception event. 
In this time-reversed version, Kovner's variational principle can be used for investigating 
gravitational lensing situations, see Kovner's original work and, e.g., [SIEIIZI- 

Without mathematical modifications, Kovner's version of Fermat's principle also applies 
to the case that, in addition to the (time-depedent) gravitational field, there is a (time- 
dependent) isotropic non- dispersive optical medium. Such a medium can be characterized by 
an index of refraction that depends on the spacetime-point but neither on spatial direction 
nor on frequency. It was observed already in 1923 by Gordon jH] that the light rays in such 
a medium are the lightlike geodesies of a pseudo-Riemannian metric of Lorentzian signature 
which is called the optical metric. So all one has to do in order to apply Kovner's variational 
principle to this situation is to replace the spacetime metric with the optical metric. 

In this paper it is our goal to generalize Kovner's version of Fermat's principle to the case 
that the light rays are the lightlike geodesies of a Finsler metric, rather than of a pseudo- 
Riemannian metric, of Lorentzian signature. There are two physical motivations for such a 
generalization. 

First, the theory of Finsler metrics is often considered as an alternative spacetime theory 
which modifies general relativity by allowing for the possibility that the vacuum is spatially 
anisotropic, even in infinitesimally small neighborhoods. Although up to now there is no 
observational evidence in this direction, such a modification of general relativity has found a 
lot of interest among theorists. An extensive list of the pre-1985 literature can be found in 
Asanov's book on the subject jH]. The variational principle to be established in this paper 
applies to light rays in such a modified spacetime theory, without a medium, and may be 
used, e.g., as a tool for investigating (hypothetical) Finsler gravitational lenses. 

Second, Finsler metrics naturally appear in optics of anisotropic non-dispersive media. 
More precisely, if one performs, on a spacetime in the sense of general relativity, the passage 
from Maxwell's equations to ray optics in a medium characterized by a dielectricity tensor 
and a permeability tensor that have to satisfy some regularity conditions, one finds the 
following, see 0, Section 2.5. Corresponding to the fact that such an anisotropic medium 
is birefringent, there are two different Hamiltonians for the light rays. Each of the two 
Hamiltonians is homogeneous of degree two with respect to the momenta. Providing the 
validity of regularity conditions, the second derivative of each Hamiltonian with respect to 
the momenta is non-degenerate and of Lorentzian signature. This means that each of the 
two types of light rays can be characterized as the lightlike geodesies of a Finsler metric of 
Lorentzian signature. In other words, the variational principle to be established in this paper 
applies to light rays in an anisotropic non-dispersive medium that is in arbitrary motion on 
the background of a spacetime in the sense of general relativity. Various earlier versions 
of Fermat's principle are contained as special cases. E.g. if the anisotropic non-dispersive 
optical medium is at rest in an inertial system, we recover the versions of Kline and Kay 
[2 , Section III. 11, and Newcomb JU]; if it is moving with temporarily constant velocity with 
respect to an inertial system, we recover the version of Glinskii ^T]. Moreover, it should 
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be noted that Fermat's principle can be applied not only to light rays but also to sound 
rays. E.g., there are versions of Fermat's principle for sound rays in an anisotropic elastic 
medium that is at rest in an inertial system (see Babich ^21, Epstein and Sniatycki and 
e.g. Cerveny and in a fluid flow that is moving with temporarily constant velocity with 
respect to an inertial system (see Ugincius jTHI)- Our variational principle indicates how these 
results can be generalized to the case of media in arbitrary motion on a general-relativistic 
spacetime. 

There is one earlier version of Fermat's principle for time-dependent anisotropic non- 
dispersive media by Godin and Voronovich ^H]- In contrast to the approach presented here, 
Godin and Voronovich make no use of Finsler geometry, and they do not work in the setting 
of a spacetime manifold with unspecified topology; they rather assume that spacetime is 
a product of 3-dimensional space and a time axis, and they make strong use of this prod- 
uct structure. The relation between the formulation of Godin and Voronovich and the one 
presented here will be clarified in Section IHl below. 

It should be mentioned that even more general versions of Fermat's principle, allowing 
not only for time-dependence and anisotropy but also for dispersion, are already available. 
In jH], Section 7.3, a variational principle in the spirit of Kovner is established for rays that 
are determined by a Hamiltonian function on the cotangent bundle over spacetime that has 
to satisfy only a certain regularity condition. (A similar, though in various technical respects 
different, variational principle was suggested by Voronovich and Godin ^ITj.) One could es- 
tablish the variational principle for lightlike geodesies in Finsler spacetimes by demonstrating 
that it is a special case of the one given in 0, Section 7.3. However, this is technically more 
difficult (and less instructive) than giving the proof directly. Therefore in this paper we 
choose the latter way. 

The paper is organized as follows. In Section |21 we fix our notation. In Section |21 we 
specify our definition of a Finsler spacetime and collect some basic mathematical facts that 
will be needed later. In Section 0] we formulate Fermat's principle for lightlike geodesies in 
Finsler spacetimes as a mathematical theorem. Section El is devoted to the proof of this 
theorem. In Section El we consider some special cases; in particular we demonstrate that, 
under appropriate additional assumptions, our version of Fermat's principle reduces to the 
one of Godin and Voronovich 

2 Notations and conventions 

We denote the tangent space to a manifold M at the point x by T^M. A circle over the T 

o 

indicates that the zero vector is omitted, i.e. T^M := T^M \ {0}. The cotangent space at 
X is denoted by T*M. For the tangent bundle we write TM and for the cotangent bundle 
we write T*M, i.e., TM := U^gM^^M and T*M := U^<.mT*M. Correspondingly we use the 

o o 

notation TM := Ux^mTxM. 

We denote points in an A^-dimensional manifold M by x, points in TM by (x, v) and 
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points in T*M by {x^p). Here x = {x^, . . . ,x^) stands for a coordinate tuple, in an unspec- 
ified local chart, and {x,v) = (x^, . . . ,x'^,v^, . . . ,v^) and {x,p) = (x^, . . . ,x^,pi, . . . ,pj\f) 
stand for coordinate tuples in the induced natural charts. This identification of points with 
coordinate tuples is non-puristic but notationally convenient (and quite common in the lit- 
erature on Finsler structures or, more generally, on Lagrangian and Hamiltonian equations). 
Correspondingly we use the familiar index notation for tensor fields, with Einstein's sum- 
mation convention for latin indices running from 1 to A^, and occasionally for greek indices 
running from 1 to — 1. 

It is to be emphasized that the use of coordinate notation does not mean any restriction 
as to the global topology. E.g., if we write an equality of vector fields, A^s) = -B*(s), along 
a curve parametrized by s, we do not imply that the curve can be covered by a single chart; 
rather, we mean the invariant equation that takes the given form in any chart which covers 
the point with parameter value s. 

3 Definition of Finsler spacetimes 

Finsler geometry was originally introduced as a generalization of Riemannian geometry, i.e., 
for positive definite metrics. This theory of positive definite Finsler metrics is detailed, e.g., in 
the text-book by Rund 18 . The systematic study of indefinite Finsler metrics, in particular 
of Finsler metrics with Lorentzian signature, began with a series of papers in the 1970s by 
John Beem. In the first paper of this series Beem defines indefinite Finsler structures 

o 

in terms of a sufficiently differentiable Lagrangian function L : TM M that is positively 
homogeneous of degree two and whose Hessian is non-degenerate (with the desired signature). 
This is an appropriate definition for the purpose of the present paper, so we will adopt it 
in the following. (For convenience we will require that L is of class C°°, whereas Beem 
allows for a less restrictive differentiability condition.) It should be emphasized that large 
parts of the physical literature on indefinite Finsler metrics is based on weaker notions. E.g., 
Asanov jH] defines Finsler structures in terms of a function F that is defined, as a sufficiently 

o 

differentiable real-valued function, only on an unspecified open subset of TM; the elements 
of this subset are called the "admissible vectors" by Asanov. On this subset, F is assumed 
to be strictly positive and positively homogeneous of degree one, and the Hessian of F"^ is 
assumed to be non-degenerate (with the desired signature). Clearly, this definition is weaker 
than Beem's. If L satisfies the assumptions of Beem, F := V±L satisfies the assumptions 
of Asanov, with the admissible vectors given as the set on which ±L is positive. (One has 
to choose the plus or the minus sign, depending on the choice of signature.) The reason for 
us to use Beem's definition, rather than Asanov's, is the following. On the basis of Beem's 
definition we can define lightlike vectors as those vectors on which L takes the value zero, 
and we can define lightlike geodesies as those solutions of the Euler-Lagrange equation whose 
tangent vectors are lightlike. On the basis of Asanov's definition, the notion of lightlike 
vectors cannot be defined (in an invariant way), because F"^ is non-zero on the admissible 
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vectors and nothing is said about the extendabihty of F"^ beyond the admissible vectors. As 
a consequence, there is no (observer-independent) notion of hght rays in Asanov's setting, so 
the question of whether hght rays satisfy a variational principle cannot even be formulated. 
For this reason, Asanov's definition is too weak for the purpose of the present paper. 
Therefore, we adopt as the basis for our discussion the following definition. 

Definition 3.1. An A^-dimensional Finsler spacetime is a pair {M,L) where 

(a) M is an A^- dimensional real second- count able and Hausdorff manifold. 

o 

(b) L : TM — > M is a C°° function that satisfies the following conditions. 

( i ) L{x, ■) is positively homogeneous of degree two, 

L{x,kv) = L{x,v) for all A; e ]0, oo[ ; (1) 

(ii) the Finsler metric 

Q-(xv)-=^^^ (2) 

is non-degenerate and of Lorentzian signature (+, ...,+,—). 

We call L the Finsler Lagrangian henceforth. It is a standard exercise to check that 
condition (b) of Definition 13.11 implies the identities 



dL{x,v, . 



dL{x, v] 



v" = 2L{x,v) , (3) 
= 9ij{x,v)v^ , (4) 



L{x,v) = ^gij{x,v)v'v^ . (5) 

We call a Finsler spacetime isotropic at the point x if the Finsler metric gij{x,v) is 
independent of f . If this is true for all pseudo-Riemannian metric of Lorentzian 

signature, i.e., it can be interpreted, in the case dim(M) = 4, as the spacetime metric in 
the sense of general relativity (but also as the optical metric in an isotropic medium on a 

o 

general-relativistic background). In this case, the set {{x,v) G T^-M | L{x,v) < O} has two 

o 

connected components for every x G M; similarly, the boundary of this set in T^-M has 
two connected components, a "future light cone" and a "past light cone". In an arbitrary 

o 

Finsler spacetime, however, the set {(a;,f) G T^M I L{x,v) < 0} may have arbitrarily many 
connected components; correspondingly, there may be arbitrarily many "light cones" . Finsler 
spacetimes with two or more future light cones at each point are probably not of physical 
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interest. (Note that a birefringent medium is not described by one Finsler structure with two 
future hght cones, but rather by two Finsler structures with one future hght cone for each, 
see Example 13 . 31 below. ) However, there is no mathematical reason to exclude them. For the 
formulation of Fermat's principle we will just have to select one light cone, and we will need 
the results stated in the following proposition. Recall that a subset S' of a vector space is 
called convex if kS + {1 — k)S C 5* for all k G ]0, 1[ and that it is called a cone if /cS* C 5* for 
all k e ]0, oo[ . 

Proposition 3.2. Fix a point x in a Finsler spacetime {M,L). Let Z^M be a connected 

o 

component of the set {{x,v) G T^M | L(x,v) < O} and let C^M he the boundary of Z^M in 

o 

TxM . Then the following is true. 

(a) ZxM is an open convex cone in T^M . 

o 

(b) CxM is a cone in T^M and a closed C°° submanifold of codimension one in T^M . 

(c) — — = gij{x,w)w^u^ < for all {x,w) G CxM and {x,u) G Z^M . 

Proof. To prove part (a), it suffices to prove that Z^M is convex because the rest of the 
claim follows directly from the definition. Beem [T^ has shown that, as a consequence of 

o 

Definition 13.11 (b), each connected component of the set {{x,v) G T^M \ L{x,v) < — 1 } 
is convex. As L{x,v) is homogeneous with respect to v, this obviously implies that each 

o 

connected component of the set { {x,v) G T^M \ L{x,v) < —(? } is convex, for every c > 0. 
As any two points in Z^M are contained in such a component, for some c > 0, this proves 
that ZxM is convex. - To prove part (b), we first observe that, by (0]), dL{x,v) / dv"^ has no 

o o 

zeros on T^M, so the set {{x, v) G T^M \ L{x, v) = 0} is a C°° submanifold of codimension 

o 

one in T^M. By definition, CxM is a subset of this submanifold. As CxM is the boundary 

o 

of the open set ZxM in TxM, it must be a connected component, or the union of several 
connected components, of this submanifold, hence it is a cone in T^M and a closed C°° 

o 

submanifold of TxM. - To prove part (c), we consider {x,u) G ZxM and a sequence of 
vectors {x, Wj) G ZxM such that {x, wj) {x, w) G CxM for J —>■ oo. As ZxM is a convex 
cone, (x, Wj + ku) G ZxM, i.e., L{x, Wj + ku) < 0, for all k G ]0, oo[ . Sending J to infinity, 
we find 

L(x, w + ku) < for all /c G ]0, oo[ . (6) 
On the other hand, Taylor's theorem yields 

L(.,w + ku) = L(.,u.) + + + 0(P) , (7) 
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which can be rewritten, with the help of (j21) and (@]), as 

L{x,w + ku) = L{x,w) + k gij{x,w) u'^ + -k^gij{x,w)u^u^ + 0(^k^) . (8) 

As L{x,w) = by assumption, ^ and (jH)) imply 

1 

gij{x,w)w^ + -kgij{x,w)u' u^ + 0[k'^) < for all A; G ]0, oo[ . (9) 

By considering arbitrarily small fc, we find gij{x, w) -u* < . Assume that gij{x, w) = 
, i.e., that u is perpendicular to the lightlike vector w with respect to the Lorentzian metric 
gij{x, w). As our hypotheses exclude the case that is a multiple of w, this assumption implies 
gij{x, w) > which contradicts 0- We have thus proven that gij{x, w) < . □ 

The non-degeneracy of the Finsler metric guarantees that the Euler-Lagrange equation 

d 9L(A(s),A(s)) dL{Xis),Xis)) ^ 



ds d\i{s) d\^{s) 



(10) 



has a unique solution s ^ \{s) to each initial condition (A(0),A(0)) = (x, f ) G TM. 
The solutions to (fTn|) are called the affinely parametrized geodesies of the Finsler spacetime 
(M, L). The homogeneity of L guarantees that for a geodesic the equation L(A(s), A(s)) = 
holds for all s if it holds for s = 0. Geodesies with this property are called "lightlike". 
When formulating Fermat's principle it is our goal to characterize lightlike geodesies by a 
variational principle. As the affine parameter along a lightlike geodesic has no particular 
physical significance, we may allow for arbitrary reparametrizations. Instead of (|1U|). we then 
get the equation for arbitrarily parametrized geodesies 

d dL{X{s)Ms)) dL{X{s),X) dL{X{s)Ms)) 
ds dX'is) dX^is) ^^^^ dX' 

where w is an unspecified function of the curve parameter. 

Moreover, the non-degeneracy of the Finsler metric implies that the equation 

Pi = gij{x,v)v^ (12) 

o o 

defines a map TM T*M, {x,v) ^— {x,p) that is locally invertible. If this map is even a 

o 

global diffeomorphism, we get a globally well-defined Hamiltonian H : T*M — M by Legendre 
transforming L, 

H{x,p) = v^^^^^ - L{x,v) = L{x,v) , (13) 

where {x,p) and (x, f) are related by (fT^ . H is positively homogeneous of degree two, 
H{x, kp) = k'^H{x,p) for k > 0, and its Hessian d'^H{x,p)/ {dpidpj) is the inverse of gij{x,v), 
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thus non-degenerate with Lorentzian signature. The projections to M of the solutions of 
Hamihon's equations with H = are precisely the affinely parametrized lightlike geodesies. 
So we may work in a Hamiltonian formalism on the cotangent bundle, rather than in a 
Lagrangian formalism on the tangent bundle, whenever we wish to do so. 

Example 3.3. The physically most relevant class of Finsler spacetimes is given by La- 
grangians of the form 

L{x,v) = ^ {i{x,vf - Ui{x)Uj{x)v'v^) (14) 

where 

(a) i{x,kv) = ki{x,v) for /c G ]0, cxd[ , 
d^£(x v)'^ 



there exists a (necessarily unique) vector field U^x) such that W{x)Ui{x) = — 1 and 
d'^i(x,v)'^ , 



These conditions guarantee that, indeed, the Finsler metric 

d'^L{x,v) 1 d'^l{x,vf 
gij[x,v] := ^ . = ^ . Ui(x)Uj(x) 15 

has Lorentzian signature, so the Lagrangian (fT^ defines a Finsler spacetime in the sense of 

____ o 

Definition 13.11 In this special case, the set { {x,v) G T^X | L{x,v) < } has two connected 
components Z^M and Z~M which are mapped onto each other by inversion (x, v) t— > (x, —v). 
If i{x,vY is a quadratic form with respect to v, the Finsler metric is independent of v. 
In the more general case, the cones Z^M and Z~M are anisotropic in the sense that we 
cannot find a faithful representation of the rotation group 0{N — 1), for dim(M) = A^, by 
linear transformations of T^M that leave Z^M or Z~M invariant. - For this example the 
Hamiltonian ()13|) takes the form 

H{x,p) = ^ {h{x,pf - U\x)U'{x)pip,) (16) 

o o 

where h{x,p) = i{x,v), with {x,v) G T^M and {x,p) G T*M related by (fT^ . Hamiltonians 
of the form (jl6p appear naturally if light propagation in a linear dielectric and permeable 
medium on a general-relativistic spacetime is considered, see jSj, eq. (2.73). In general, such 
a medium is birefringent; there are two types of light rays, and each of the two types is 
governed by a Hamiltonian of the form (fT^ . with the same vector field W{x) but different 
functions h{x,p). In this case W{x) is the 4- velocity field of the medium and, for each of 



8 



the two types, the function h{x,p) is built in a fairly complicated way from the spacetime 
metric, the dielectricity tensor and the permeability tensor. In this sense, Finsler spacetimes 
with Lagrangians of the form ()14|) and, equivalently, Hamiltonians of the form ()16|) have 
interesting applications in (general-relativistic) optics in media. They can also be consid- 
ered as alternative spacetime models, generalizing the formalism of general relativity to the 
(hypothetical) situation that the vacuum light rays are determined by anisotropic light cones. 

4 Fermat's principle 

It is now our goal to characterize, in a Finsler spacetime (M, L), the lightlike geodesies from a 
point g to a timelike curve 7 by a variational principle. As the trial curves for this variational 
principle we want to consider all lightlike curves from q to 7. Since, at each point x G M, 
the Finsler light cone may have arbitrarily many components, it will be necessary to restrict 
to those lightlike curves whose tangent vectors, on arrival at 7, belong to the connected 
component of the light cone that is selected by the tangent vector of 7. This leads to the 
following definition. 

Definition 4.1. Choose, in an A^-dimensional Finsler spacetime (M, L), a point g G M and 
a embedding 7 : / — M with L(7(t), 7(t)) < 0, where J is a real interval. For each t G /, 

o 

let Z^(t)M denote the connected component of the set {(7(t),f) G T^(i)M|L(7(t), t>) < O} 
which contains the vector (7(t),7(t)). Define the space of trial curves Cg^^ as the set of all 
maps A : [0, 1] — M with the following properties. 

(a) A(0) = g. 

(b) There is a r(A) G / such that A(l) = 7(r(A)). 

(c) A is lightlike, i.e. 

L(A(s),A(s)) = for all s G [0,1], 
and (A(1), A(1)) lies in the boundary of Z^(^r{\))M. 

By an allowed variation of A G C^^^ we mean a map A : ] — eg j [ ^ [0, 1] — ^ 
(e, s) (-^ A(e, s) such that K{e, ■) G Cg,-y for all e and A(0, ■) = A. 

Part (b) of Definition 14.11 defines the arrival time functional r : Cg,^ I. If we have an 
allowed variation A of A, we can consider the map e ^ r(A(£:, ■)) which maps a real interval 
to a real interval. To link up with the traditional notation of variational calculus, in the 
following we use the symbol 5 for the derivative with respect to e at e = ; e.g., we write 

(17) 

e=0 

The desired version of Fermat's principle can now be formulated as a mathematical the- 
orem in the following way. 



..(A) :=;^ .(A(...))) 
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Theorem 4.2. (Fermat's principle for Finsler spacetimes) A curve A G Cq^j is an 
arbitrarily parametrized geodesic if and only if St{X) = for all allowed variations of A in 



The proof will be given in the next section. 

The statement of Theorem 14.21 can be rephrased in the following way. Among all ways to 
go from g to 7 at the speed of light, as it is determined by the field of light cones selected by 
7 according to part (c) of Definition \4.1\ the light actually chooses those paths that make the 
arrival time stationary. In the isotropic case, i.e., if the Finsler metric is independent of v, 
Theorem 14 . 21 reduces to Kovner's version of Fermat's principle which was proven in jl]. In 
this special case we know from |1] that only local minima and saddles, but no local maxima, 
of the arrival time occur. This result is based on the analysis of conjugate points along the 
respective geodesic. One can formulate a Morse index theorem for this situation and, 
under additional assumptions on the global spacetime structure, even set up a full-fiedged 
Morse theory [21] . It is interesting to investigate whether similar results hold in the general, 
i.e. anisotropic, case of Theorem 14.21 Such an investigation will be postponed to future 
studies because it requires additional preparatory work on the second variational formula. It 
is true that conjugate points are well-defined and their basic properties are well-established 
whenever grangian with non-degenerate Hessian (see, e.g., Morse |j24j. Section 

1.5), so in particular for the geodesies of a Finsler metric with arbitrary signature. However, 
the finer aspects of the theory, in particular the relation between the second variation and 
the number of conjugate points, have not been worked out for indefinite Finsler metrics so 
far. (For positive definite Finsler metrics see Crampin |23] and earlier references given 
therein.) 

5 Proof of Fermat's principle 

We begin with the proof of the 'only if part of Theorem 14.21 which is quite easy. So let 
us assume that A G Cq_^ is a geodesic; as r is invariant under reparametrizations, we may 
assume, without loss of generality, that A is affinely parametrized. As all varied curves are 
lightlike, we have for every allowed variation 



After calculating the (^-differentiation under the integral and integrating by parts this leads 
to 



C, 





I 



d dL{\{s),\{s)) _ dL{X{s),\{s)) 
d^ dX'{s) dX'{s) 



) 



6X' ds 



dL{Xis),X{s)) 
dX^{s) 



6X'{s) 



s=0 



1 



(19) 
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As A is an affinely parametrized geodesic, the bracket under the integral vanishes. From part 
(a) and (b) of Definition 14.11 we find 



6X\0) = , (20) 

6y{l) = f(r(A))MA). (21) 

With dH) this reduces (0 to 

= g,,{X{l)Ml))X^{s)Y{r{X))6r{X) . (22) 

Part (c) of Proposition 13.21 guarantees that gji[X{l), X{1)) A-'(s) 7' (r(A)) 7^ , so we have 
found that, indeed, 5r(A) = 0. 

The proof of the 'if part of Theorem 14. 21 is more involved. We first establish a lemma that 
chararacterizes, along a curve A G Cg,^, the set of variational vector fields 6X{s) = deA{e, s)\s=o 
that come from allowed variations A. 

Lemma 5.1. For X G Cg^-y, a C°° vector field s y-^ (A(s),A(s)) along X is the variational 
vector field of an allowed variation, A = 6X, if and only if 

A%0) = , (23) 
A^(l) is a multiple of Y{t{X)) , (24) 
dL(X(s),X(s)) , 9L(A(s),A(s)) .. 

— / A\s) + ^ \ ^' ^ ^' A\s) = . 25 

dX^is) ^ ^ dX'{s) ^ ' ^ ' 

Proof. Clearly, if A is the variational vector field of an allowed variation, A = 6X, it has to 
satisfy the three conditions; this follows immediately if we apply the variational derivative 
6 to the three conditions (a), (b) and (c) of Definition 14.11 Now let us assume, conversely, 
that we have a vector field A that satisfies the three conditions. It is our goal to construct 
an allowed variation such that A = 6X. We give this construction here only under the 
additional condition that A can be covered by a local coordinate system whose A^-th basis 
vector field d/dx^ is timelike, L{x,d / dx^ {x)) < 0, with d/dx'^ ['y{T{X))) = 7(r(A)). (If 
this condition is violated, which may happen if A has self-intersections, the proof requires 
to cover A with several coordinate patches. The details of this patching procedure, which is 
somewhat awkward although straight-forward, can be carried over from the proof of Lemma 
2 in [4J.) Using this coordinate system, we construct the desired allowed variation A from 
the given A* in the following way. We define the first (A^ — 1) coordinates of A by 

A"(e, s) = A°(s) + e A"(s) , a = 1, . . . , (A^ - 1) . (26) 

With these (A^ — 1) coordinates of A known, the A^-th coordinate of A is to be determined 
by the differential equation 

L{A{e,s),dM^,s)) = (27) 
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and the initial condition 

A^(£,0) = 0. (28) 

For e sufficiently small, this initial value problem has indeed a unique solution s i— > A.^{e, s) 
on the interval [0, 1] which is close to A^. To demonstrate this, we ffist observe that (j27j) 
can be locally solved for dshJ^{e,s), because = gNjix,v)v^ is non-zero by part (c) of 

Proposition 13.21 for x = A{e, s) and v = dsA{e, s). So the initial value problem has a unique 
solution on some interval [0, so[ . For e = 0, the solution exists up to some sq > 1 because the 
curve A exists on this interval. By continuity, for all sufficiently small e the solution exists 
up to the parameter 1. By construction, all curves s ^ A{6, s) satisfy the three conditions 
(a), (b) and (c) of Definition 14.11 so A is, indeed, an allowed variation of A. Finally, we 
have to verify that the variational vector field 6X of this variation coincides with the given 
A. This is obvious from ()26p for the ffist {N — 1) coordinates. It is also true for the iV-th 
coordinate because both vector fields are tangent to the surface L = which, again by part 
(c) of Proposition 13.21 is transverse to the A^-lines of our coordinate system. □ 

We are now ready to prove the 'if part of Theorem 14.21 So assume that at A G Cq^^ the 
condition 6t{\) = holds for all allowed variations. Let s ^ (A(s),i?(s)) be any C°° vector 
field along A that vanishes at the endpoints, B{0) = and -B(l) = 0. We choose a vector 
field s ^ (A(s), U{s)) along A with L(A(s), U{s)) < for all s G [0, 1] and U%1) = f (r(A)). 
(Such a vector field exists because L takes negative values on an open set.) We define a 
function / : [0, 1] — > M by the differential equation 



dL{X{s)Ms)) 



dL{\is),X{s)) 



[b\s) + fis)U%s))+ (29) 



, BXs) + fis)U\s) + fis)U\s) 

oX\s) ^ 

and the initial condition /(O) = 0. Part (c) of Proposition 13.21 guarantees that 

dL{Xis) Xis)) ^^^^^ ^ g^^^X{s)Ms))X^{s)U^{s) ^0, (30) 
oX\s) 

so (Unj can be solved for /(s) and the initial value problem has, indeed, a unique solution. 
If we now define 

A\s) := B%s) + fis)U\s) (31) 

we immediately verify from Lemma (5.11 that it comes from an allowed variation, so we may 
write = (5A*. For this variation we find St{X) = /(I) from condition (b) of Definition 14.11 
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Next we define a function h : [0, 1] — > M by tlie differential equation 

dL[\{s)/\{s)) 



dXHs) 



U\s)h{s) = (32) 



dL{X{s)Ms)) dL{X{s)Ms)) _ d ^ dL{X{s)Ms)) 

dX'is) ^ ^ dX^is) ^ ^ ds\ dX'is) ^ ^ 

and tlie initial condition h{0) = 0. As above for the function /, part (c) of Proposition 13.21 
guarantees that this initial value problem has a unique solution. By multiplying (jSH)) with 
the integrating factor e^*-*-* we get 

9L(A(s),A(s)) dL{Xis),X) ^ .^^^9L(A(s),A(s)) 



e'^^^^ ^ V' ^ ' + h{s) ' ' r ' " B\s) = (33) 

yds dX\s) dX^{s) ^ ^ dX' ^ 

ds V dX-{s) ^ ^ ' ^ ' ^ 

Integration of this equation from to 1 yields, owing to the boundary conditions -B*(0) = 0, 
S'(l) = 0, /(O) = 0, and /(I) = ^r(A): 

yds dX^s) dX^is) ^ ^ dX' J ^ ' ^ ^ 

,,,, 9L(A(1),A(1)) ^ , ^ 
5A*(1) 

By hypothesis, 5t(A) = 0; so we have demonstrated that the left-hand side of ()34p is zero 
for any 5* that vanishes at both end-points. Hence, the fundamental lemma of variational 
calculus implies that the bracket under the integral is zero, i.e., that A satisfies which is 
the defining equation for an arbitrarily parametrized geodesic. 



6 Some special cases 

A major difference of our variational principle, in comparison to standard variational prin- 
ciples with relevance to physics, is in the fact that the functional to be varied, that is the 
arrival time, is not given as an integral over the trial curves. In this section we will specialize 
to situations where it is indeed possible to rewrite the arrival time as such an integral. This 
will bring our variational principle closer to the standard literature on variational calculus 
and, at the same time, it will clarify the relation of our variational principle to some earlier 
versions of Fermat's principle. 

To that end we specialize to the case that our A^-dimensional Finsler spacetime (M, L) 
can be covered by a single chart in which the A^th coordinate vector field d/dx'^ is timelike. 
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L{x^d/dx^ {x)). (Actually, it would be sufficient for the following reasoning to assume that 
M is an open subset of a fiber bundle, with timelike fibers diffeomorphic to M. However, for 
notational convenience we will restrict to the more special case.) We can then consider our 
variational principle for the case that q is an arbitrary event in M and 7 is an integral curve 
of d/dx^ , i.e. 7 = d/dx^ o 7. Now along each trial curve A for our varitional principle the 
equation 

L{X{s),t{s),X{s),dt{s)/ds) = (35) 

holds, where we have written 

A(.) = (Ai(.),...,A^-i(s)), tis) = A^(s) . (36) 
Proposition 13.21 (c) guarantees that can be solved for dt{s)/ds along every trial curve, 

^ =/(A(.),t(.),A(.)), (37) 

which defines a function /. Now, owing to the fact that 7 is an integral curve of d/dx'^ , the 
arrival time is the same as the travel time measured in terms of the coordinate x'^ , up to a 
number that is the same for all trial curves, 

r(A) = [ ^^^^ds + constant . (38) 
Jo ds 

Hence, by fj37|) our variational principle takes the form 

f{X{s),t{s),X{s))ds = 0. (39) 

Jo 

This is precisely the variational principle of Godin and Voronovich ^1^. (In the case of an 
isotropic Finsler spacetime it reduces to the variational principle given in Theorem 3 of [3|.) 
We have thus shown that this variational principle of Godin and Voronovich is a special case 
of our version of Fermat's principle, formulated in Theorem 14.21 This special case differs in 
two respects from the more general version of Theorem 14.21 First, the variational functional 
is now written as an integral. Second, the trial curves are now curves A in space, rather than 
in spacetime; each curve A starts at the spatial point to which the spacetime point q projects 
and terminates at the spatial point to which the spacetime curve 7 projects. So it is a purely 
spatial variational principle for curves between two fixed points. However, in the integrand 
of (j39p the function t{s) appears. This function has to be determined, for each trial curve 
A, by solving the differential equation with the initial condition t(0) = to, where to is 
the x^ coordinate of the spacetime point q. Only after t{s) has been determined for each 
trial curve can the variational principle ()39|) be set into action. (Trial curves for which t{s) 
is not defined on the whole interval [0, 1] have to be discarded.) In most cases, determining 
t{s) is very awkward if not impossible; so it is usually recommendable to stick with the more 
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general spacetime version of our variational principle, as given in Theorem 14 ■2[ rather than 
to switch to the more special spatial version of ()39|) . even in cases where the latter holds true. 

There is one situation, however, in which the spatial version is indeed much more conve- 
nient, namely if there is a function e'^^^''"^ such that 

^(e^^(-)L(x,.)) =0. (40) 

In this case, we call d/dx^ a generalized conformal Killing vector field. (If fl40|) holds with a 
function h that is independent of v, d/dx^ is called a conformal Killing vector field, and if 
()4()|1 holds with h identically equal to zero, d/dx'^ is called a Killing vector field.) Then the 
function / of ()37|) is independent of t{s), i.e., the variational principle ()39p takes the form 

6 f f{X{s),X{s))ds = 0. (41) 
Jo 

This is a purely spatial variational principle that does not involve the necessity to solve 
additional differential equations. It is easy to verify that the homogeneity of L implies that 
/ is positively homogeneous of degree one, 

f{x,kv) = kf{x,v) for A; G ]0,oo[, (42) 

so the functional in 1)4111 is invariant under reparametrization. If we add the assumption that 
the Hessian of / with respect to the (purely spatial) velocity coordinates is positive definite, 
()41|1 is equivalent to varying the length functional of a positive definite Finsler metric; then 
the solution curves A are, of course, the geodesies of this positive definite Finsler metric. The 
variational principle (PT|) is of the same form as the time-independent versions of Fermat's 
principle that have been discussed, for light rays in anisotropic media, in [21 El E] and, for 
sound rays in anisotropic media, in [T21 UHl UHl [HI ■ 

This construction also works the other way round. We can start with a function f{x, v) 
that satisfies the homogeneity condition ()42|) and has a positive definite Hessian with respect 
to the velocity coordinates. We can then define a spacetime Lagrangian 

L{x,x'',v,v'') = \{fix,vr - (vy) . (43) 

L gives us a Finsler spacetime for which d/dx'^ is a Killing vector field. (Note that this 
is a special case of the Lagrangian considered in Example 13.31 ) Our version of Fermat's 
principle says that the lightlike geodesies of this Finsler spacetime project to the geodesies 
of the positive definite spatial Finsler structure given by /. Thus, our variational principle 
encompasses, in a spacetime formulation, all time-independent versions of Fermat's principle 
where the spatial rays are the geodesies of a positive definite spatial Finsler structure. 
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7 Outlook 



The Fermat principle in Finsler spacetimes presented in this paper is a satisfactory formula- 
tion for rays in time-dependent anisotropic situations, as long as dispersion does not occur. 
It conveniently comprises many earlier versions in a geometrical spacetime setting. However, 
some questions are still open. 

As already mentioned at the end of Section HI the second variation formula for our varia- 
tional principle has not been evaluated so far. This is of relevance to the question of whether 
a solution curve is a local minimum, a local maximum or a saddle of the arrival time func- 
tional. It would be desirable to investigate whether the index of the second variation is 
related to the number of conjugate points, in analogy to the Morse index theorem of the 
pseudo-Riemannian case. 

There are two more technical generalizations of our Fermat principle which have not 
been worked out so far. First, in this paper we have restricted to Finsler spacetimes of 
class C°°, and we have formulated Fermat 's principle for trial curves of class C°°. For some 
applications it might be recommendable to consider piecewise smooth Finsler structures and 
piecewise smooth ("zig-zag") trial curves. We have not done this here because it makes the 
proof considerably more cumbersome. Second, it is likely that the non- degeneracy of the 
Finsler metric could be a little bit relaxed. E.g., the Lagrangian 

L : TM^ ^ M, (44) 
{x,v) ^ ^/ {v^Y + ■ ■ ■ + {v^-^Y - {v^'f 

violates condition (b) (ii) of Definition 13.11 because the Finsler metric degenerates on the 
v'^-axis, for each = 1, . . . , iV — 1; hence, this case is not within the class of Lagrangians 
for which we have proven Fermat's principle in this paper. However, as the set of lightlike 
vectors for which the non- degeneracy condition is violated is a set of measure zero, it might 
be possible to show by a continuity argument that Fermat's principle is still valid in this case 
and in similar cases. 

Finally, it should be stressed again that the formulation of ray propagation in terms of 
Finsler geometry excludes dispersion, i.e., it does not apply to cases where the propagation of 
rays depends on frequency. If rays are derived from a Hamiltonian on the cotangent bundle 
over spacetime, dispersion is absent whenever the Hamiltonian is positively homogeneous 
(see, e.g., jH], p. 116), as is inherent in Finsler geometry. Therefore, any formulation of 
ray propagation that includes dispersion has to leave the domain of Finsler geometry. It 
was already mentioned that a version of Fermat's principle allowing for time-dependence, 
anisotropy and dispersion was brought forward in [Hj, Section 7.3 (and that another such 
version was suggested by Voronovich and Godin ^7]). The problem with this version is 
that it is a variational principle for curves in the cotangent bundle over spacetime, not for 
curves in spacetime. The condition under which it can be reduced to a variational principle 
for curves in spacetime is given in jH], Section 7.3. However, this is only a statement on 
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existence; even if one has verified, for a particular case, that this condition holds true, it is 
not obvious how to get an explicit formulation of the variational principle in terms of curves 
in spacetime. E.g., such an explicit formulation was worked out for rays in a non- magnetized 
plasma, which is an example of a dispersive and isotropic medium, in [2^. On the other 
hand, for a magnetized plasma, which is an example of a dispersive and anisotropic medium, 
such a formulation does not exist so far. In view of applications to astrophysics, this is the 
most interesting case for which a spacetime formulation of Fermat's principle in the spirit of 
Kovner, allowing for arbitrary time-dependence, is still to be worked out. 
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